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Abstract 

We consider the ensemble of real Ginibre matrices conditioned to have positive fraction a > 0 
of real eigenvalues. We demonstrate a large deviations principle for the joint eigenvalue density of 
such matrices and introduce a two phase log-gas whose stationary distribution coincides with the 
spectral measure of the ensemble. Using these tools we provide an asymptotic expansion for the 
probability that an n x n Ginibre matrix has k = an real eigenvalues and we characterize the 
spectral irreasures of these matrices. 


1 Introduction 

Random matrices constitute a central topic in modern probability theory (l}|^ and an important tool 
for an increasing number of applications, from physics to biology or engineering |9- 

The present study deals with specific properties of a canonical family of random matrices, the 
real Ginibre ensemble. Under a reference probability P, the entries of such matrices are i.i.d. normal 
random variables with variance J where n is the matrix size. In particular we intend to look deeper 
into the properties of real Ginibre matrices with anomalously large number of real eigenvalues, which 
are still largely unknown. These constraints are drastic for the random matrices, and affect the shape 
of the distribution of the eigenvalues as illustrated in Fig. We aim at characterizing the eigenvalue 
distribution of random matrices when the number of real eigenvalues k is proportional to the matrix 
size n. 

Our work reveals that in this regime, the empirical spectral measure (ESD) markedly departs from 
the one of the unconditioned ensemble. We establish that when n —)• oo, the ESD converges to a 
limit that is supported by both the real line and the complex plane. We characterize the macroscopic 
properties of this limit and analyze its microscopic organization. In the process, we also obtain an 
estimate for the probability that an n x n matrix has k real eigenvalues with k = 0{n) as n —)• oo. 

*CNRS UMR 7241, INSERM U1050, UPMC ED 158, MEMOLIFE PSL* 
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Figure 1: Top: Superposition of 50 spectra of 50 x 50 Ginibre matrices conditioned to have 26 real 
eigenvalues (a) and unconditioned (b). Black lines correspond to the unit circle and red lines correspond 
to our estimation of the support of the complex part of the spectrum in the large n limit. Bottom: 
Histogram of the real eigenvalues of 200 50 x 50 Ginibre matrices conditioned to have 26 real eigenvalues 
(c) and unconditioned (d). Histograms are normalized with respect to n. Red line corresponds to our 
estimation of the distribution of real eigenvalues in the large n limit. 


Literature review. Before presenting our main results, we concisely review some relevant past 
results in random matrix theory. Gharacterizing the spectral properties of the real Ginibre ensemble 
has been an active held of research. A milestone in this direction was the computation, hrst in [I^ 
and later in [I^ , of the joint probability distribution of the eigenvalues Ai,..., A„ of n x n real Ginibre 
matrices: 

i,--- ,A„] =C'n]J|Ai-AjI I ]Jexp(-nAi)erfc 

i>j \i=l 



where the asterisk denotes the complex conjugate and Cn is the normalization constant. We denote in 
the sequel by the associated probability measure (see equation (26)). 

A later breakthrough was the characterization of the correlations between eigenvalues in terms of 
Pfaffian processes in a series of studies [l4-20 . More recently, this picture has been further augmented 


by the description of the distribution of the spectral radius and of the largest real eigenvalue of this 
ensemble of matrices |21] . 

Much work has also been devoted to the characterization of the spectrum of real Ginibre matrices 
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in the limit of n —>• cx). We denote by //"■ the empirical spectral distribution of such matrices of size 
n X n defined as: 

1 

P) 

i=\ 

where are the eigenvalues of the matrix. For M” a real Ginibre matrix scaled by l/-v/n, it is 

now well known that pT' converges to the uniform distribution on the unit disk as n —?• oo, a result 
known as circular law: 

A"" ^ “l|x|<i • (3) 

vr 

For real Ginibre matrices, this result was first demonstrated in |13| . It has been now established that 
the circular law is universal in the sense that convergence of A”" to the uniform distribution on the 
unit disk holds for matrices composed of i.i.d. random variables with zero mean and 1/n variance |22] . 
Furthermore, local properties, such as correlations between eigenvalues are also universal in the n —>■ oo 
limit for matrices with independent elements with exponentially decaying distribution and moments 
matching the normal distribution up to fourth order |23] . 

Large Deviations Principles (LDP) for random matrices were derived in [^[^ , where it was shown 
that the sequence of empirical measures of satisfy a LDP with speed and rate function 


X[/i]: 


(4) 


M !-)> M 

^3 if - I f^og\x- y\dfi{x)dn{y)) - K 

for symmetric Gaussian ensembles (i.e. Gaussian Orthogonal Ensemble) and the real Ginibre ensemble 
respectively. In the former, the map I acts on probability measures on the real axis W1(''(M) and 
iP = I + j log 2. In the later, the map I acts on probability measures on C symmetrical with respect 
to complex conjugation: Wif (C), and iP = |. 

The distinctive feature of the spectrum of real Ginibre matrices is that it has a non-zero probability 


of having real eigenvalues. As shown in 26 the empirical spectral distribution of finite real 
Ginibre matrices has a singularity on the real line because there is a positive probability of having 
real eigenvalues. As the matrix size n goes to infinity, this singularity disappears because the expected 
number of real eigenvalues is of order ^/n. The first numerical reports on this scaling appeared in 
and a rigorous proof for the average number of real eigenvalues and higher order expansions in 
Its universality was established in (^ . Recent studies have provided a more detailed analysis of the 
distribution of real eigenvalues of real Ginibre matrices, notably the inter eigenvalue gap distribution 
which takes on an approximately semi-Poisson form in the bulk. 

the author introduced the probabilities for a real Ginibre matrix of size n to have k real 
eigenvalues and provided numerical estimates for some special cases of and exact expressions for the 
specific case k = n. Integral expressions for these probabilities for small n were derived |32|, enlarging 


29-31 


In 


13 


the range of values that could be numerically computed. An exact expression for k^ = n—2 and its large 
n asymptotic behavior was derived in using the integrable structure of the real Ginibre ensemble 
and Pfaflian properties. Recently, fine asymptotic estimates of for k small {k = o{\.og{n)/y/n)) were 
analytically derived in |34| . To our knowledge, there is no asymptotic expression of p^ for large n and 
general k. 

Another very efficient method for the study of the spectra of the Gaussian /3 ensembles was proposed 
This pioneering work made a deep analogy between Id log-gases, i.e. freely moving charged par- 


m 


35 


tides with quadratic confinement and logarithmic repulsion, and the eigenvalues of matrix-valued sym¬ 
metric real Ornstein-Uhlenbeck processes. In detail, the equilibrium distribution of a one-dimensional 
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log-gas at an inverse temperature (5 is precisely the distribution of the eigenvalues of the Gaussian /3 
ensemble (symmetric, hermitian or quaternionic random matrices), and the equilibrium density of a 
two-dimensional log-gas is identical to the distribution of eigenvalues of the complex Ginibre ensemble. 
This link between interacting particle systems and spectra of random matrices has proved an essential 
tool to demonstrate properties of the spectrum of random matrices even in cases with extremely low 
probability. As an example, the use of the log-gas for symmetric matrices was instrumental in the 
characterization the spectrum of random matrices with anomalous densities 36 37 , with applications 
to data analysis. 

From the mathematical viewpoint, the existence and uniqueness of solutions to Id log-gas systems 
as well as the convergence as the system size goes to infinity were proved in |38] and for a more general 
class of gases in |39]. For more on log-gases we refer to [^. 


Methods and summary of the main results. Our methods rely on the derivation of a LDP for 
the Ginibre ensemble conditioned on the proportion of real eigenvalues a = kjn. Our contribution 


here is to extend the LDP in l24 25 to the situation that interpolates between the two cases presented 


to allow for measures that are supported both on the real axis and the complex plane. We find that 
such matrices satisfy a LDP with rate and rate function X, where AT = |. 

From our LDP, we are able to show that when k/n —)• a and n —>■ oo, \ logp^ scales as T[iJLa\ — K 
where /Xq, is the minimizer of the rate function X on the set Mq = {zz e A4f (C) ; z/(M) = a}. In 
particular, for the case a = 1 one can see that 


^logp: 


-X[/xi] = - ^log2 , 


which coincides asymptotically with the exact formula derived in 13 and with the formula of p ”_2 
derived in 33 ; actually, our result shows that this logarithmic equivalent is valid for Pn- 2 r 
r E N and not only for r = 0 or 1. However, obtaining a closed form expression for the minimizer 
is not straightforward. Nonetheless, we are able to derive a precise qualitative picture of the support 
and shape of the minimizer through the use of a constrained optimization problem |40] . 

To gain a deeper understanding on the minimizer, we next introduce and investigate the log-gas 
whose stationary distribution corresponds to the eigenvalue distribution of the class of matrices we 
are studying. In contrast with the existing literature, this log-gas is neither one nor two dimensional: 
it is a mixture, in the complex plane, of the two types of gas, one fraction of the particles being 
confined on a singular region of the plane. We use this gas to obtain numerically for various values of 
a, approximations of the distribution /x„. 

The above steps characterize the macroscopic properties of the limit distribution /x„. We comple¬ 
ment these by a description of microscopic features. To this end, we use renormalization techniques 
inspired from hydrodynamics and the Ginzburg-Landau theory |M-43 that were extended to inves¬ 


tigate the microscopic organization of particles in log-gases with applications to the distribution of 
eigenvalues of the Ginibre ensemble. This approach has unveiled in particular a crystallization phe¬ 
nomenon in one dimension, and led to conjecture that particles in two-dimensional log-gases organize 
according to a regular triangular lattice in the zero temperature limit 
methods to our mixed-type problem. 


41,42 . We readily apply these 


Organization of the paper. Since the complex eigenvalues of real matrices come in pairs of complex 
conjugates, matrices where k and n have different parity have probability 0. For this reason through 
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the text we assume that the number of real eigenvalues k has the same parity as n. Also we introduce 
the notation cm for the closest integer to an with the same parity as n. 

The article is organized as follows. In section we establish a specific LDP for real Ginibre 
matrices conditioned to have k = cm real eigenvalues and the asymptotic estimation of in section 
We characterize more precisely the form of the distribution of real and complex eigenvalues minimizer 
in section We introduce and analyze in section the Id 2d log-gas whose stationary distribution 
is identical to the eigenvalues of a Ginibre matrix constrained on having a specific number of real 
eigenvalues. Finally, in section we derive the renormalized energy for the mixed gas and discuss its 
implications in terms of the distributions of the particles in the zero temperature limit. 


2 Large deviations principle for k = an and analysis of the rate func¬ 
tion 


Gonsider G with n G N a sequence of real Ginibre random matrices with k^ real eigenvalues, 

such that kn has the same parity as n and k^/n —)• a G [0,1]. The large deviations principles shown 
in ^4 25 correspond to a G {0,1}. We now show that they can be extended to a G (0,1). To this 
purpose, we define Ato as the subset of symmetrical probability measures Atf (C) exactly charging a 
mass a to the real line: 

:= {/r G Mf (C) ; /i(M) = a} 


For fixed a G (0,1) and finite n, the space Ma contains empirical measures of matrices of size n only if 
an is an integer value. Therefore, this space is slightly too small in order to understand the convergence 
properties of the spectrum of sequences of random matrices asymptotically charging non-trivial mass 
on the real axis: the spectral density of matrices of size n can only charge a mass proportional to 1/n 
to the real axis. In order to take into account these fluctuations of the mass on the real axis as n is 
increased, we introduce the decreasing sequence of spaces: 


M^^ = {peMf{C):\p{R)-a\<-}. 

n 

The limit of this sequence is exactly Ada, and for any n G N, the space contains all empirical 
spectral densities of matrices with a number of real eigenvalues k = cSn. Throughout the paper we will 
use the classical Levy topology, which provides a metric for the topology of weak convergenc^ We 
will be interested in sequences of random matrices {M'^)n>o whose spectral density belongs to Ad”. 
These matrices satisfy the following estimates: 


Theorem 1. For any zz G Ado, we have: 


lim limsup ^ logP[/l” G Ad” fl B{i/, J)] < —T[v] 
n —>-00 n 




lim lim inf logP[/l” G Ad” Pi Bin, 5)] > —T\v\ 

_(5\0 n^oo 


where B{i', 6) is the Levy ball of radius 6 centered at v and iL = |. 

^For the sake of completeness, we recall that the distance between two probability measures (/r, c) on C is defined as: 

u) = inf{(5 > 0; n{A) < n{A^) and n{A) < /r(T^) VT G t?(C)} 
where B(C is the Borel algebra of C and for A G A^ = {x; d(x, A) < 5}. 
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This property is more general than a Large Deviation Principle (LDP) in that in allows to take into 
account the constraint on the asymptotic proportion of real eigenvalues, which vary with the sequence 
index. We denote the half complex plane as H = {z G C; 9(z) > 0}. 


Proof. The proof is based on a combination of evaluations and methods proposed in 24 25] in order 


to prove large deviations principles for the Wishart or Ginibre ensembles. The first inequality (upper 
bound) is proved directly by considering the joint eigenvalue density. For the sake of completeness, we 
provide a sketch of the proof in Appendix highlighting the main differences with the one in |25] . 

The lower bound is slightly more complex. In [^, the authors propose an original construction 
of a particular set of points on the real line, from which they construct a measure whose probability 
compares to the rate function and lower bounds the probability we aim at controlling. This construction 
was generalized in |25j where the points now belong to H. For our purposes, a mixed construction both 
on the real line and on El proves necessary to ensure the property stated in the theorem. To this purpose, 
we define I = {n — k)/2 the number of complex eigenvalues in El and the measures i>r{z) = u{z)1z£R 
and vci^) = ^(2^)l^eC\R- The construction, schematically described in Fig proceeds as follows. 
By the continuity properties of the rate function proved in |25j , we can assume that I'c has no atom 
with continuous and everywhere positive density with respect to Lebesgue’s measure on El, and we can 
define a square H in El containing at least (1 — a)(l — 1/n) of the mass. Similarly for i'r on M, one 
can define a bounded interval TZ containing a mass at least equal to «(! — 1/n). The square H (resp. 
the interval TZ) can be decomposed into D disjoint squares ... (resp. 6 disjoint intervals) of 

length proportional to Tjy/n (resp. 1/n), in each of which are set a fixed number of complex (resp. real) 
points based on the density v in each of these intervals. These points are denoted (Ai < • • • < A^) G 
^ (see 


and (Zi, • • • , Z;) G I 
these points is that: 


24 


25 


for the details of the construction). The important information on 


(i) . Aj are the boundaries of the intervals (remark that the A* could alternatively be fixed as quantiles 

of the distribution vr), 

(ii) . the number of points Zj is related to the mass contained in the square they are contained in, 

(iii) . the distance to the boundary of the square, as well as distances between two points, are lower 

bounded by Cj y/n for some constant C. 


Note that such a construction yields an empirical measure whose mass is smaller that one. This is 
completed by adding some points outside Ti, satisfying the same conditions of distance to boundaries 
and between points. Thus the constructed empirical distribution is close from in the sense that 
the distance (in total variation) between these two measures is arbitrarily small as soon as taking into 
account a sufficient number of points (A*) and a sufficiently fine partition of El. Around these points, 
we can define small intervals [A* — 5/2, A* + 5/2] and for sufficiently small e > 0, non-overlapping 
balls centered at Z® with radius e/n. The union of these balls is denoted D^. From this construction, 
following exactly the same algebra as in the pure real or complex case, we obtain: 
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Figure 2: The artificial empirical measure constructed for the lower bound in the ball centered at v. 
The density of v charges both El (light blue region) and a part on M (bold black line). The square 
H C El contains in its interior a mass greater (1 — a)(l — 1/n) and is partitioned into smaller squares 
of typical size of order Ijy/n. Atoms Zj are blue circles, surrounded by green balls of size e/n, and 
together form the space Dg. Along the real line, orange circles are the atoms A* and green intervals 
are of amplitude 8. 


( 1 

Q:^{B(y,8)^Ml) > SnDnexp \^{-l + ^) \Zj? + "^\og\Zi - Z^\\Z* - Z*\- l{l + fc)log(l - - 




—k ^^(|Aj| + h)^ + log I Aj — Aj I + 2 log \\i — Zj\j 

i=l i^j i=l j=l / 

> BnDn exp ^(/c + 1)^ J log |x - y\duR{x)di'R{y) - {k + 1)^ J x^dvR{x) 


1/2 


2=1 


^ / hog\x-y\diyc{x)duc{y)+ ‘^kl log\x - y\duc{x)duR{y) + R{S,e,n) 


where C denotes a constant independent of n (possibly depending on the parameters 8 and e). Bn a 
constant depending on all parameters such that log(Sn) = o{v?), Dn a constant satisfying the same 
property as in the proof of the upper bound (1/n^ log{Dn) —^ K), and R(8, e, n) a term tending to zero 
when (5 or e tend to zero, and is negligible compared to as n —)• oo (this term is obtained explicitly). 
From this expression, it is now easy to see that : 


n' 


■ ^og{Q'^{B{v, 5) n Ma)) > ^ J \x\‘^di'R{x) + j J log\x - y\duR{x)dvR{y)^ 

+ \xfdiyc{x) + J J log\x - y\di2c{x)duc{y)^ 


+ ^/ I ^os\x - y\dRcix)di 2 ciy) + o{l) 
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and therefore in the limit n —)• cx) we conclude that: 


lim lim inf 
( 5\0 n->-oo 


log(Q”'(B(iy, 6) n M2)) >K -a j \x\‘^di'R{x) + J j log - y\dvR{x)dvR{y) 

- (1 - a) j \x\‘^diycix) + (1 - a)"^ j j log \x - y\di^cix)di^ciy) 


+ 2q!(i “ y y log k - y\diycix)dRciy) 
=I[auR + (1 - a)i^c] 


which ends the proof. 


□ 


Similarly to a LDP, the above theorem readily ensures the following 


Theorem 2. The rate function X has a unique minimizer on the space Me 
empirical measures fX" G M2 converges in the Levy topology towards ya- 


The sequence of 


Proof. We start by noting that the space Ma is convex since any convex combination of elements of 
Moreover, it is a closed subset of A^f(C). The convexity of I 


Mn remains in 


24 


25 


ensures 

that the map X restricted to A4 q remains lower-semicontinuous, and has compact level sets in A4 q.. In 
particular, it implies that the map X is strictly convex on Ma, guaranteeing that there exists a unique 
minimum pLa in Ma- In order to ensure that sequences of empirical processes in M2 converge towards 
pLa exponentially fast with speed n^, we further need to ensure that these sequences are tight. That 
property is, again, a consequence of the analogous properties proved in the case of one-dimensional log- 
gases and on the the unconstrained Ginibre ensemble (seen as a consequence of the first result). 


Here, the elegant proof proposed in 25 readily extends to our constrained case and allows ensuring 


the exponential tightness of sequences of random matrices with spectral distribution constrained to 
belong to M2, as we now outline. Defining for K G [0,1] and r > 0 the compact set subset of 

M2, as: 

= {faGM2; h(B2) > K}, 

where is the complex centered ball with radius r, and = {^ G A4f (C) ; pi{B2) > K} it is easy 
to see, using the inequality log(|x — y|) < 2 -|- (|xp -|- |yp)/4, that: 


G < gn J Y\'i' = l'"dXiY\dxjdyjlAK^, exp J 


{\z\^ + \zf)dfl{z,z') 


< gn exp - 


Kn^r"^ 


where ft is the empirical measure with atoms at A* and xj ± iyj and log{gn) = 0{n^). This inequality 
readily yields the desired of exponential tightness including the constraint that empirical distributions 
belong to M2- 

We have therefore built up all necessary ingredients for proving convergence of our constrained 
ensembles. Indeed, classical Large Deviations theory show exponentially fast convergence of a sequence 
of empirical measures under the condition that the sequence is exponentially tight and satisfies a LDP 
with good rate function (the sequence converges towards the unique minimum of the rate function). 
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Here, analogous properties were proved under the constraint that each element of the sequence belongs 
to A42, and the classical proof ensures convergence of the constrained measures towards the minimum 
of the rate function on Ma- 

□ 

This method of proof of the convergence of the sequence of empirical measures /i” to fia the unique 
minimizer of X on Aia goes beyond the case of constraining the Ginibre ensemble to the number of 
real eigenvalues, and provides an account for so-called log-gas method which proved very efficient for 
the understanding of rare events in the Gaussian and Wishart /3-ensembles p^[37| or the Ginibre 
ensemble |^ . 

in our purpose to characterize Ginibre matrices with prescribed proportion of real eigenvalues, we 
therefore need to find the minimizer gia of the rate function in If this is possible, we can access 
(i) to the typical distribution of eigenvalues under our constraint and (ii) to the probability of these 
events at leading order, logarithmically equivalent to exp(—n^X[/ro,]). Our efforts will therefore now be 
devoted to the characterization of these distributions. 

3 Asymptotic behavior of 

We now use the LDP in order to characterize the asymptotic behavior of the probability of having 
asymptotically k ~ an real eigenvalues. For M" a random matrix of size n x n from the real Ginibre 
ensemble, we recall that 


= F[M^ has k real eigenvalues] = P[/i”(M) = k/n] (5) 

where is the empirical spectral distribution of M'^. To account for the discrete nature of the mass 
on the real line and for the parity of the number of complex eigenvalues, we consider the following 
quantity: 

= P[|/i"(M) - a| < 1/n] = G Ml] . (6) 

In order to characterize this quantity, it is tempting to apply directly the large deviation principle 
of Theorem 1.1] on the set A = Ma which is closed with empty interior. One therefore obtains 
the upper bound: 

limsup logPl/t^lM) G A] < — inf X[z^] = —I[pa] ■ (7) 

n^oo M u&A 

Here one faces two difficulties. First, because of finite size effects the quantity P[/i"'(M) G A\ is not 
exactly equal to P[/1”(M) G Ml]. In fact, if a is irrational logP[/i"(M) G H] = —oo for any n. Second, 
even if this problem is solved, only an upper bound holds. 

A much better estimate can be achieved using the results of theorem 

Corollary 1. For any a G (0,1), we have: 

lim ^ logpg^ = lim ^ logP[/i’^ G Ml] = -X[pa] (8) 

n^oo 77 ,^ n^oo 

Proof. In the proof of theorem we have shown that for any n € Ma and <5 > 0: 

limsup ^ logP[/l" G Ad” n H(z^, (5)] < — inf X[/r] (9) 

n—^■oo ^ 
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and 


liminf ^ logPf/t"' G H Biv, <5)1 > —1[v] (10) 

n—>-oo 

And therefore, letting <5 —>■ oo, we obtain: 

limsup ^ logP[/i"'G Af”] < — inf X[^] (11) 

n—>-00 /j-SAIq; 

liminf ^ logP[/i" G AIq] > —X[u] (12) 

n^oo Tt^ 

- ~ 

UeMa 


readily proving result announced in equation 


□ 


This result shows that the probabilities decrease as e~^ . For a = 1, we know that is 

the semi-circular law, and X[/ri] can be readily computed and is equal to log(2)/4 (it is exactly the 

difference between the constant term in the rate function for Hermitian matrices and that for non- 

2 

). This shows that log ~ ~ x ^ map (/? : N i—>■ N 


Hermitian matrices, see e.g. 


24 


25 


with (^(n)/n —?• 1. In particular, logp”_ 2 j. log 2 for any r G N. Furthermore the continuity 

and convexity properties of a i—)• X[p,a\ proven below imply the continuity and convexity of the map 
a ^ lim„^ooPSn- 


4 Properties of the minimizer jda 

We investigate the qualitative properties of the distribution the large n limit spectral distribution 
of the Ginibre ensemble with asymptotic proportion a of real eigenvalues. 


4.1 Continuity and monotonicity of a ^ pa 

In this subsection, we establish continuity of the minimizer and of the minimum of the rate function 
on the subspaces A4 q with respect to the parameter a, as well as the convexity and monotonicity of 
the minimum. 


Proposition 1. 


1. The mappings a G [0,1] i—)• pa and a G [0,1] i—)• T[pa\ o-re continuous. 

2. The mapping a G [0,1] i—>■ T\pc^ is convex and increasing. 

Proof. 


1. We define: 


Ja[p] ■ = 


T[p] if // G Ma 
- 1-00 otherwise. 


(14) 


X has a unique minimizer in A4 
use the theory of F-convergence 

(on)n>o £ [0; 1]^ Converging towards a G [0,1]. For each a 


45 


In order to show the 
Theorem 1.21 p.29]. 


continuity properties in a we will 
In detail we consider a sequence 
there exists a minimum p^n 
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map Jan- To show that fia„ —t /icn we will show that the sequence of functionals Jan T-converges 
to Ja- This amounts to proving a few inequalities on the limits of the sequence of functions Jan 
together with an equi-coerceness property of the sequence {Ja„)n- In detail, it suffices to prove 
that the sequence of processes satishes the following properties: 

(a) for any fj- £ Ai and any converging sequence t a with fj-an following inequality 

holds: 

\im inf Janil^aj > Ja[^A (15) 

(b) for any fj- G Ai, there exists a sequence «„ —)• a and i^an ^ an converging to ^ such that: 

limsup < Ja[lj\ (16) 

n^oo 


(c) (Equicoerciveness): for all n G N and for all t > 0 there exists a compact set Kt C AI^(C) 
such that {u : Ja„[i^] < i} Kt- 

We now prove that the three assumptions indeed hold in our case. 

(a) This property is a consequence of the regularity properties of good rate functions. Indeed, let 
/U G Wt be a hxed measure. If /.i ^ AIo, then for n large enough, cannot be in -Ala„, the above 


liminf is inhnite and inequality (15) is trivial. Otherwise, if /x G Afa, the above inequality is 


actually a direct consequence of the lower-semicontinuity of X : we can assume that /Xq,„ G Aia„ 
(or at least a subsequence), so that and then one uses the lower-semicontinuity 

of X to obtain 

(b) Given a measure fj- £ .Ai, the second step amounts to Ending a sequence t a and 

a sequence of measures G -Aian with —)• ^ such that the inequality (16) holds. For 

/X ^ -Aia, the inequality is trivial, hence valid for any sequence. Otherwise, can be written as 
jjL = afiR -|- (1 — a)fj,c, and for any sequence On —t a, one defines Van — + (1 — an)iJ-c- 

Then, G -Aian and JanWan] = Therefore, one obtains (16) by continuity of a i—>■ 

X[a^R -h (1 - a)/xc]. 

(c) The equicoerciveness property is a consequence of the fact that X is a good rate function, 
hence has compact level level-sets. Therefore, for any t G M, there exists a compact Kt such that 
{X < t} C Kt- We thus have {Jan < t} = {T < t} O -Man X Kt- 

These three results proved the T-convergence of Jan towards Ja, and therefore of /Xq,^ to /Xq, for 
any sequence a-n —?• a, which concludes the proof of the hrst point of the proposition. 


2. We now prove the convexity and monotony of a i—>• X[/Xo]. Let a, a' £ [0,1]. Since ^ i—)■ X[^] is 
convex, we have: 

X[t/x -|- (1 — t)v] < tX[/x] + (1 — i)X[ix] for t G [0,1], ^ £ Aia and v £ Aia' (17) 

Therefore, taking the inhmum on the above equation shows that a i—)• X[^a\ is convex. Since, 
X[/xo] < X[/xi] and X[/Xq,] > X[/xo], we conclude that the continuous mapping a i—?■ X[^a\ is 
necessarily increasing. 


□ 
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Remark 1. Since a i—)■ T[Ha\ is increasing, the minimum of the rate function conditioned on is 
equal to the minimum on A1+ = {/U G Aff (C) : /u(M) > a}: 

irif X[/r] = inf X[/i] (18) 

Me Mi 

With this continuity properties in hand, we return to the characterization of the qualitative features 
of jJ-a- 


4.2 Qualitative description of fia 


We are now interested in characterizing the distribution fia- The large deviation principle defines this 
distribution as the minimum of I, which can therefore be characterized through the computation of 
the differential of X in the space AIq,. This differential is a linear operator, acting on the space of 
signed measures h symmetrical with respect to the real axis and such that h{C) = 0, with traces 
on the real axis and on H satisfying = 0. Denoting afi^ the trace of on the 

real axis and on H (with these definitions, //^(M) = //^(BI) = 1), we find that: 


dfjl[h] ~ 2 j ~ J log \z — z'\dfi{z)dh{z') 


which can be rewritten as: 


d^Rl[fi] = J J log\x — x'Adfi^{x) + C({1 — a) J log\x — z\‘^dfi^(z)'^ h^{dx) 

= J |(1 — a)| 2 :p + a(l — a) J log [z — x|^d/U^(x) 

+il-af j {log\z - z'A + log\z - z'*A) d^J.^ {z')^ (dz) . 


Necessarily, provided that these differentials are bounded at fia, the linear behavior described by the 
above equation fails (indeed, the freedom to replace h by —h would contradict the minimality of X on 
AIq. at fia). Therefore the differential operator is equal to zero for any acceptable measure h, implying 
the following proposition: 


Proposition 2. The measure satisfies the system of integral equations defined for any x G M and 
z as 


/ log I. - xf + a(l - „) / log |x - AVM) = 0 

(1 - a)\zA + (1 - otf J (log \z - z'A + log \z - z'*^) d^/ft{z') + a(l - a) ^ log \z - x'pd^f (x') = 0. 


For the two extreme eases a G {0, 1} we reeover the eireular law fi^{x) = 0, yA' {z) = for a = 0 

and the semi-eireular law ij,^{x) = fj,sc{x) = ^V2 — = 0 for a = 1. 

These equations are generally too complex to be solved explicitly. It is however possible to charac¬ 
terize further the minimizer for intermediate values of a. 

Theorem 3. Denoting a/j,^ and the traee on M and H of the minimizer gLa of X on AIq. for 

OL G [0,1]. The measure fia has the following profile: 
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i. has a density with respect to Lebesgue’s measure on M such that g^ G This density 

vanishes outside an interval [—i2cs,i?Q,] for some constant > 0 

a. has a constant density with respect to Lebesgue’s measure on its support C HI. This 

support is a simply connected bounded open set and is such that H M = 0. 


In 46 the authors have analyzed the limiting distribution of the eigenvalues of the complex Ginibre 
ensemble conditioned on the event that a large proportion of the eigenvalues lies in an open subset of 
the complex plane. The situation we consider is similar in the sense that we have extra weight on the 
real line. While this is not treated explicitly in 46 , the argument of proof goes through with adequate 
modifications. 

This theorem provides us with a qualitative description of the minimizer ga- A more precise 
characterization of the minimizer requires the use of other techniques, namely numerical ones that are 
described below. 


5 Log-gas for Ginibre conditioned on k 

The present section introduces and analyses a log-gas whose equilibrium distribution is related to ga- 
The log-gas approach to spectral analysis of the real Ginibre ensemble presents two new challenges 
with respect to previous analysis in the literature. First, the spectra of real matrices are symmetric 
with respect to the real axis so one has to impose this symmetry in the gas. Second, due to the positive 
probability of having real eigenvalues, the log-gas is composed of two interacting phases: a 2d phase 
supported on the complex plane and a Id phase constrained to the real line for all timers. This last 
property has a very important consequence, namely that the dynamics of the whole gas depend on the 
quantity of real particles k with respect to the total number of particles n. Our method is based on 
the introduction of a mixed ld-2d log-gas whose stationary distribution is that of the spectrum of real 
Ginibre matrices with k real eigenvalues. 

This system is composed of n interacting particles in a two-phase gas: k of these particles are 
confined to the real axis and the other 21 = (n — k) particles form I pairs of complex conjugated 
particles outside the real axis. We denote the set of eigenvalues {Aj}i<i<n € C with A* = (xj,0) for 
I < i < k, \i = (xi, yi) ior k < i <k + l with yi > 0 and A* = A*_; for k +1 < i < n. 

5.1 Log-gas equations 

To derive the equations for the gas dynamics for fixed k, we identify the joint probability densities Q 
as a Boltzmann factor and compute the corresponding potential, following the method of j^. One 
obtains the following dynamics 

dXi{t) = -^dBi - Vj4>(Ai, • • • , Xn)dt (19) 

where a = \/2, Bi is a real Brownian motion with Bi = {B^ 0 ) for i < k, and Bi = {Bj^, B[) a complex 
Brownian motion for k < i < k + 1. Symmetry is ensured by imposing Bi = B*_i for k + I < i < n. 
We denote Vj = {dx^, dyf). The potential is given by 

$(Ai, • • • , A„) = E E + E (20) 

i i 
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with 


V{\i,Xj) = -log|Ai - Xj\ 


? / ? 1 

U{Xi) - ly^^o fy + {eT^c{\yi\^P^ 


( 21 ) 


We will be specially interested on the t —)• oo limit of the empirical measure 




( 22 ) 


2=1 


The system (19) is well posed: we prove below that solutions neither collide nor blow up. The 
result is actually more general and extends to particle systems within more general confining potential 
U we state: 

Theorem 4. Suppose Aj(0) for i = 1,... ,k + I are distinet and Ai(0) = Ay;(0) for k + I < i < n, 
thatU is C'^ and the maps {x,y) ^ {—xdxU{x + iy),—ydyU{x + iy)) and {x,y) AU{x,y) are upper 
bounded in by some constant 7 . The processes Aj are defined by (|19|) up to the stopping time 


T = inf{t : Aj(t) = Xj{t) for some j ^ i or yi = 0 for some i > k or Xi = 00 for some i} . 

If < 2, then P[r = 00 ] = 1; there is no collision or explosion and the particle system is defined 
for all times. 


Proof. The proof follows the one in |38|. To prove that there are no collisions we show that the drift 

(23) 


term in (19) is bounded from above. We apply Ito’s formula on to obtain 




dt 


where Aj = df. -\- df].. Developing the drift term we have 


E 

j 

E 


Vj 

^2 




O' 

2n 






- E - iVU{Xy)f - Y, ^V,V{X„Xr){VU{X,) - VU{Xr)) 

The term —(V17(Aj))^ is clearly upperbounded. Moreover, our assumptions ensure that both terms 
2 

^AU{Xj) and the last term are upperbounded. Therefore, the only term that we have to control is 


EE A.) - ^(V,V(A,,A.))' 

j rj^j 
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Because of the noise in the system, the collisions between real and complex particles as well as between 
complex non conjugated particles are impossible. On the other hand, because of the conhnement of 
some of the particles to the real line and the symmetry with respect to the real axis, collisions between 
two real particles and between a pair of complex conjugate particles are possible. Developing the 
Laplacian and the gradient squared for those interactions (real-real and complex-complex conjugate) 
one can see that 




if Aj, Xf' G 


2n2 |A,-A-P 


and hence, using the same arguments as in 38 , one can show that for < 2 particles do not collide 
before an explosion. 

Now it remains to show that there is no explosion in hnite time. Let Rt = x'j -|- = (/xf, /) 

where /(A) = |Ap/2. We have to prove that Rt < oo for all t. As shown in Rt < a.s. for all t 
where R't solves 


dR' = -\l2R'dWt + ( G + 


n 


■tuvvt 


cr^ (n — 1) 


2 n n 


-Ijdt 


for Wt a Brownian motion. is a squared Bessel process which is known not to explode so Rt does 
not explode either. 

□ 


Remark 2. The assumptions on the conhning potential U of the above theorem are clearly satished for 
U given by (21). 


The result of this theorem ensures existence and uniqueness of the particle system for all times and 
hnite n. If this system converges in time towards a non-singular stationary distribution, the limit is 
necessarily given by the joint pdf ([^ of the eigenvalues of the Ginibre ensemble under our constraints. 
In this case the empirical measure of the particle system (t) converges in distribution as t —>• oo to the 
empirical spectral measure of the Ginibre ensemble with k real eigenvalues. Therefore, simulating 
the long term dynamics of the particle system will provide an approximation of 


5.2 Numerical simulations 

In principle, it is possible to generate numerically the spectra of Ginibre matrices conditioned on k by 

drawing Ginibre matrices at random and classifying them according to the number of real eigenvalues. 

2 

However, for the events we consider, k deviates from its expected value and decrease as e“"' , so 
that matrices with large k are not readily accessible with such a strategy. As we dispose of an explicit 
form of the joint probability distribution of the eigenvalues under our constraint, standard sampling 
methods, such as the Monte Garlo algorithm described in appendix allow to access directly these 
rare events. Unfortunately, they are not efficient for large matrices. 

The same limitations at large n arise for simulations of the log-gas given that the system is stochastic 
and one needs to average over several runs to smooth out fluctuations. To overcome these, we took 
advantage of the fact that the noise term perturbing each particle vanishes. Therefore, for estimating 
n —)> oo spectral distributions, instead of stochastic simulations, we ran deterministic simulations of the 
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gas. Even though the latter do not correspond to the eigenvalues of actual matrices, their n —)• oo limit 
converges to the limit distribution of the eigenvalues, which is exactly what we seek to approximate. 
We checked convergence by increasing progressively the number of particles of the gas until the shapes 
for the complex support and the distribution of the real phase were visually indistinguishable. 

Here we summarize our key findings, which can be considered educated guesses based upon known 
facts and numerical evidence (see figure |^. The first claim is that the border of the support of each 
connected component of the complex part of fia is a smooth closed curve for all 0 < a < 1. This 
closed curve varies continuously with a. In fact, as a goes from zero to one, the support v decreases 
monotonically (in the sense of strict inclusion) from the unit semi-circle in H to the point z* = (0,y*) 
where z* is found by minimizing the potential that a single pair of complex particles would experience 
interacting with = ^sc which is given by the map z i— \z'^\ — / log \z — x\dfisc{x)- The coordinate 
y* is hence the solution of: 



Coincidentally, the support of grows monotonically from [—1,1] to [—\/2, \/2] as converges to 
the semi-circular law in [—\/2, \/2]. 




Figure 3: Left: Numerical estimation of the support of for a G {0.1,0.3, 0.5,0.7,0.9}. The crosses 
correspond to the points (0, ±y*), the dashed line is the unit circle. Right: Numerical estimation of 
for a G {0.1,0.3, 0.5, 0.7,0.9}. The dashed line is the semi-circular law. The estimations are long 
time simulations of log-gases with n = 1000. 


6 Renormalized energy and microscopic organization at zero temper¬ 
ature 


The gas provides a precise description of the minima of the potential (20) even at temperatures that 


differ from that corresponding to the eigenvalues of the Ginibre ensemble. In particular, expansion of 
the energy of the system supplies further information on the microscopic organization of the particles 
at vanishing temperature. To this end, following the work for log-gases in dimension one |41] , two (4^ 
or higher [43] , one shall compute the next-to-leading order terms of the energy. These terms correspond 
essentially to the microscopic arrangements of the particles. Since we have shown that in the regime we 
consider with k = cm, there exists a macroscopic gap between the real axis and complex eigenvalues, 
the energy related to the microscopic interactions of real onto the complex eigenvalues and reciprocal 
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forces vanish in the thermodynamic limit. Once this has been noted, a direct application of the results 
in one and two dimensions 1^,42 leads to state the following: 


Proposition 3. For large n, the equivalent energy of a Ginibre matrix conditioned on having k ~ an 
real eigenvalues enjoys the following expansion around pLa = + (1 — a)p^ the minimizer of the 

macroscopic energy: 


^n,a — ^ 


(1 + a) , . X , O, .K2 , Ki 

-nlog(nj + n (1 — aj-h a — 

2 1 27r vr J 


— n 


(1-a) / dp^\og{p^) + a I dp^logip^) + o(n) (25) 


.c 


iti 




where the K 2 and ki are universal constants related to the dimension of the spaces where and p^ 
are supported. 


From this expansion, the minimization at zero temperature leads to state that the real eigenvalues 
crystallize |^ , and to the conjecture that the complex eigenvalues organize in a regular triangular 
lattice (called Abrikosov lattice in the superconductivity domain |^), which is proved under the 
assumption that the organization is a regular lattice |42|. 


7 Conclusion 


The main conclusions of our work can be outlined as: 


1 . Despite the fact that the joint probability distribution of real Ginibre matrices is always given 
the same compact formula above (Eq.Q), the limit distributions of the empirical measure fF 
strongly depends on the number k of real eigenvalues. When kfn ^ a and n —>■ oo, we prove 
that the empirical measure has a unique limit pa that significantly departs from the circular law. 


The key method in establishing the above result is an LDP theorem devised to take into ac¬ 
count both real and complex eigenvalues. Previous LDPs discarded real eigenvalues, as in the 
unconstrained matrices, the fraction of real eigenvalues tends to zero. While we have provided 
the proof in the specific case of the real Ginibre ensemble, it can readily be adapted to other 
situations, such as gases in higher dimensions with more general confining and repulsive poten¬ 
tials such as |48] or heterogeneous gases with constraints (fraction of particles restrained to 
a subspace). It also extends to Gaussian ft and Wishart ensembles conditioned on rare events 
(such as anomalous proportions of eigenvalues in a given interval). In this sense, our approach 

These generalizations 


provides a theoretical basis for the log-gas method used in |36, 37 44 50 


allow, for instance, to analyze the impact of anomalously large numbers of real eigenvalues on 
the spectra of random asymmetric matrices with some level correlations in the entries such as 
those in |12l|28 . 


3. One of the consequences of the LDP was to provide an asymptotic for logp^ which scales as 
—n^X[pa\. To our knowledge, this is the first derivation of large n estimate for 


4. The theoretical and numerical analysis of pa established that, unlike the circular law which is 
supported by a disk, the measure pa has two distinct components. The first is supported by 
a compact set of the complex plane that is well separated from the real line, and upon which 


17 

















has uniform density of mass 1 — a. The second is supported by a segment in the real plane 
and has a density w.r.t. Lebesgue’s measure with mass a. As a increases to one, the support of 
the former shrinks to a single point and its complex conjugate whereas the latter tends to the 
semi-circular law on [—\/2, \/2]. 

5. The microscopic characterization of the particle distributions at zero temperature through the 
renormalized energy reveals that, (i) in the complex plane, particles organize in an Abrikosov 
lattice, similar to the unmixed 2d gas, yet (ii) on the real line, they are crystallized similarly to 
the Gaussian Orthogonal Ensemble in the zero temperature limit, but unlike real eigenvalues of 
unconstrained real Ginibre matrices. 

The last two points above establish that the real Ginibre ensemble constrained by fc/n —)• a in¬ 
terpolates between the circular and semi-circular law as a shifts from zero to one at the macroscopic 
level. At the microscopic level, it is a mixture of the two extreme cases. This interpolation is distinct 
from the ones in which the circular law is progressively flattened on the real line with intermediate 

It reveals some of the rich characteristics of real random 
matrices due to their spectrum containing both real and complex eigenvalues. 


elliptic like support for the spectra 12 


A Upperbound for the Large Deviations Principle 

We now sketch the proof of the upper bound of the Large Deviations estimates: 

Proof. We define I = {n — k)f 2 the number of complex eigenvalues in H. Similarly to [^, by a direct 
use of the joint pdf Q, we find that the mass of a ball B{iy, 5) n At” is: 

6 ) n AtS) = nti d\ nU dXj ULi , A.] (26) 

^ d^ri /Rfexff nLi dXt 0^=1 dXj nLi dYklB{u,S)nM" exp ^ z')dfY{z)dfY{z')^e^^/‘^ 

where 


\x^ “ 1 “ \y^ 

fuj{x,y) = min ( w, ^---log|x - y| ) , 


the measure (Y defined as: 


/ k i 

\i=i j=i 


Xi+iYi + dx3-iYj j ) 

and the coefficient is such that l/n^log(Il„) —)■ K. We hence have: 

, k i i 




HdXiYldXjUdYk 

i=l j=l k=l 


exp ( - ^ . / fu^{z,z')dix{z)dii{z'))i 

V 2 Jc^ > 


,Knl2 


allowing to conclude that: 


4 log{(3”(B(^, <5) n Xa) < 'ZYfX - 1 inf 


[ fujiz,z)dfi{z)d^i{z') + o{l). 
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The first term in the right hand side converges towards a constant. The main difference with the proof 
of |25| is the fact that the right hand side of the inequality depends on n. Here, the sequence of real 
values Ic^ non-decreasing because of the inclusion prop¬ 
erty of the spaces It is upper bounded by fc^ fojiz, z')dfj,{z)dfi{z'), and therefore 

converges towards this maximal value. This allows to conclude that: 

limsup^log(( 5 "(H(z^, (5) n At”)) < K-l- inf f f^{z, z')d^{z)djji{z') 

n—>-oo /^GB(z/,(5)nA4a 

and by continuity of the map // i—)• J ^2 fioiz, z')d^{z)d^{z') in the weak sense, we eventually have: 

lim limsup ^logP[/i” G Al"nH(zy,(i)] < A+ f f^{z, z')dv{z)du{z'). 

5\0 n^oo Jc^ 

Our result follows by letting uj to infinity and using the monotone convergence theorem. □ 

B Monte Carlo algorithm for the eigenvalues 

An efficient method to approximate numerically the minimizer /Tq and the probability distribution 
of the proportion of real eigenvalues is to use the Metropolis-Hastings Monte Carlo algorithm. This 
method consists in constructing an ergodic Markov chain whose stationary distribution is given by Q. 
Here, we evolve a n-particles system zt^ but in contrast to the log-gas, the dynamics is now discrete, 
and the transition probability is based on the pdf Q: a new configuration z* is drawn by modifying 
one of the eigenvalues at random and the Markov chain has a transition towards z* if Q'^{z*) > Q'^{zt), 
and otherwise according to a Bernoulli variable of parameter 

When conditioning on very rare events, (here for instance, a fixed number of real eigenvalues), 
cases satisfying the constraints have an extremely low probability of being explored, and more refined 
methods need to be developed in order to access these probabilities. In the present case, the problem is 
considerably simplified since we dispose of an explicit form of the distribution of the eigenvalues under 
our constraint. Indeed, the joint probability distribution of Ginibre matrices of size n constrained on 
having k real eigenvalues (A* ; i = 1 ■■■ k) (and therefore I = (n — k)/2 pairs of complex eigenvalues 
{zi,i = 1 • • • re — k)) is given by: 

( k n—k 

J|exp(-A-) exp(- 2 ;f)erfc(| 2 ;i - z*\/\/2) 

2=1 2=1 

Classical Metropolis-Hastings algorithm with Gaussian transitions preserving the nature of the system 
therefore allow to access directly the distribution of eigenvalues and the probability p(re, k) of the event 
considered. 
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